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Abstract —Low-rank matrix approximation (LRMA) is a pow¬ 
erful technique for signal processing and pattern analysis. How¬ 
ever, its potential for data compression has not yet been fully 
investigated in the literature. In this paper, we propose sparse low- 
rank matrix approximation (SLRMA), an effective computational 
tool for data compression. SLRMA extends the conventional 
LRMA by exploring both the intra- and inter-coherence of data 
samples simultaneously. With the aid of prescribed orthogonal 
transforms (e.g., discrete cosine/wavelet transform and graph 
transform), SLRMA decomposes a matrix into a product of 
two smaller matrices, where one matrix is made of extremely 
sparse and orthogonal column vectors, and the other consists 
of the transform coefficients. Technically, we formulate SLRMA 
as a constrained optimization problem, i.e., minimizing the 
approximation error in the least-squares sense regularized by ^o- 
norm and orthogonality, and solve it using the inexact augmented 
Lagrangian multiplier method. Through extensive tests on real- 
world data, such as 2D image sets and 3D dynamic meshes, we 
observe that (i) SLRMA empirically converges well; (ii) SLRMA 
can produce approximation error comparable to LRMA but in 
a much sparse form; (iii) SLRMA-based compression schemes 
significantly outperform the state-of-the-art in terms of rate- 
distortion performance. 

Index Terms —Data compression, optimization, low-rank ma¬ 
trix, orthogonal transform, sparsity 

1. Introduction 

G iven a matrix X G corresponding to n data 

samples in low-rank matrix approximation (LRMA) 
(a.k.a. principal component analysis (PCA) and subspace fac¬ 
torization) seeks a matrix X G of rank k min(m, n) 

that best approximates X in the least-squares sense ifH . 
Alternatively, the rank constraint can be implicitly expressed 
in a factored form, i.e., X ^ X = BC where B G and 

C G (see Figure [T]). This decomposition is not unique, 
since (BA)(A“^C) = BC holds for any invertible matrix 
A G To reduce the solution space, one often requires 

the decomposed matrix B to be column-orthogonal. 
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Fig. 1. Illustration of low-rank matrix approximation, k <C min(m, n) and 
= I/j.; Each column of X corresponds to one data sample. 

Since LRMA is able to reveal the inherent structure of 
the input data, it has been applied to a wide spectrum of 
engineering applications, including data compression El, ll^. 
background subtraction lH, O, classification and clustering 
lEl, El, image/video restoration and denoising El, (II, image 
alignment and interpolation ca, mil, structure from motion 
ca, Ha, ina, etc. We refer the readers to the survey paper 
ca for more details. 

Among the above-mentioned applications, we are partic¬ 
ularly interested in data compression. At the first glance, 
applying LRMA to data compression seems to be straight¬ 
forward, since one only needs to store k{m n) elements, 
with small approximation error introduced in LRMA. Such 
an idea has been used extensively to compress various types 
of data, e.g., images/videos El, CSl, Cll, EH, GSl, EOl . 
El, 3D motion data ED, ED, ED, ED, ESl . traffic data 
ED, EtI . ESl . However, data samples usually exhibit both 
/nfra-coherence (i.e., coherence within each data sample) and 
/nf^r-coherence (i.e., coherence among different data samples). 
LRMA can exploit the inter-coherence well, i.e., using B with 
much smaller orthogonal columns to represent X, but it fails to 
address the intra-coherence in the columns of B, and hereby 
compromises the compression performance ED, ED, ES, 
E6l . ED, ESl . Figure Eta) shows the problem using a 2D 
image set. One can clearly see that the columns of B produced 
by LRMA are locally smooth (the bright areas), indicating the 
strong coherence. 

In this paper, we propose sparse low-rank matrix approxima¬ 
tion (SLRMA) for data compression. In contrast to the existing 
methods, which usually explore intra- and inter-coherence 
separately (see Section HB, SLRMA is able to explore both 
the intra- and inter-coherence of data samples simultaneously 
from the perspective of optimization and transform. As Figure 
El shows, SLRMA multiplies a prescribed orthogonal matrix 
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^ G (such as discrete consine/wavelet transform 

(DCT/DWT) and graph transform) to the input matrix X and 
then factors into a product of the extremely sparse and 
column-orthogonal matrix B and the coefficient matrix C. 
We formulate SLRMA as a constrained optimization problem, 
i.e., minimizing the approximation error in least-squares sense 
under the ^o-norm and orthogonality constraints, and solve it 
using the inexact augmented Lagrangian multiplier method. 

Through extensive tests on real-world data, such as 2D 
image sets and 3D dynamic meshes, we observe that (i) 
SLRMA empirically converges well; (ii) SLRMA can produce 
comparable approximation error as LRMA but in a much 
sparser form; (iii) SLRMA-based compression schemes out¬ 
perform the state-of-the-art scheme to a large extent in terms of 
rate-distortion performance. Figure Hb) visualizes the column 
vectors of B of the proposed SLRMA which produces the 
same approximation error as that of LRMA in Figure EJa), 
where one can clearly see that column vectors of SLRMA do 
not exhibit such intra-coherence. 

The rest of this paper is organized as follows: Section [II| 
reviews the related work on LRMA-based data compression. 
Section |III| briefiy introduces LRMA and graph transform. 
Sections [IV] presents the sparse LRMA algorithm followed 
by experimental results in Section jV] Finally, Section [Vl| 
concludes this paper and points out several promising future 
directions. 

Notations. Throughout this paper, scalars are denoted by 
italic lower-case letters, vectors by bold lower-case letters and 
matrices by upper-case ones, respectively. For instance, we 
consider a matrix A G The i-ih row and j-th column of 

A are represented by a* G and G respectively. 

Let aij be the (i,jf)-th entry of A with its absolute being 
\aij\. A^ and A^ are the transpose and pseudoinverse of A. 

We denote by ||A||f = yjY7=i Ej=i and ||A||oo = 
maxij \aij\ the Frobenious norm and the loo norm of A, 
respectively. The ^o-norm || A||o counts the number of nonzero 
entries in A. Let Tr(A) = an be the trace of A. is the 
identity matrix of size k x k. 

II. Related Work 

In this section, we briefiy review the LRMA-based methods 
for data compression. Yang and Lu ilEl represented the basis 
vectors using vector quantization for image coding. Similar 
techniques were also adopted in video compression El, iini- 
Note that training coodbooks is computationally expensive due 
to the large amount of training data. Moreover, the trained 
codebooks may be biased, leading to large approximation 
error to the data if they are fundamentally different than the 
training data. Du et al. GSl applied LRMA to hyperspectral 
images to remove the spectral coherence, then adopted JP2K 
ll29l , the standard image encoder, to further compress the 
basis vectors. Hou et al. Ii] extracted a few keyframes from 
geometry videos and compressed them using a standard video 
encoder H.264/AVC (301, and the whole geometry video 
was reconstructed by linearly interpolating the decompressed 
keyframes. Chen et al. EH and Hou et al. ll2Ql also presented 
robust LRMA (H and video encoders combined compression 


frameworks for surveillance videos and geometry videos, 
respectively. 

3D triangle meshes are a simple and flexible tool to rep¬ 
resent complex 3D objects for graphics applications ranging 
from modeling, animation to rendering. A mesh consists 
of structural and geometric data. Dynamic meshes are the 
sequence of triangle meshes with the same connectivity, repre¬ 
senting a deformable model with time-varying geometry. Alex 
and Muller ED adopted LRMA to compress 3D dynamic 
meshes, representing each frame as the linear combination 
of few basis vectors. Kami and Gostamn EH improved this 
scheme by applying second-order linear predictive coding to 
coefficients to further remove the temporal coherence. Heu et 
al. ED proposed an adaptive bit plane coder to encode the 
basis vectors to achieve progressive transmission. However, 
these methods cannot eliminate the spatial (or intra-) coher¬ 
ence. Unlike ED, El, ED, trajectory-LRMA was employed 
in El, El, El, El, El, which represents trajectories of 
vertices as a linear combination of few basis vectors. Sattler et 
al. EH proposed clustered LRMA to compress 3D dynamic 
meshes, in which clustering of trajectories and trajectory- 
LRMA were performed simultaneously to explore the spa¬ 
tial and temporal correlation. Vasa et al. 13411 studied the 
trajectory-LRMA-based compression for 3D dynamic meshes 
systematically. For example, to remove the coherence among 
basis vectors, they proposed predictive coding to encode them. 
They also introduced three types of predictions to exploit 
the coherence among coefficients, namely parallelogram-based 
prediction 1^ . least squares prediction, and radial basis 
function-based prediction 1^ . Recently, they used the discrete 
geometric Laplacian of a computed average surface to encode 
the coefficients 1^ . and achieved the state-of-the-art rate- 
distortion performance. But this method requires the models 
of sequences to be manifolds due to the process of computing 
an average surface, limiting its range of applications. 


HI. Preliminaries 


A. Low-Rank Matrix Approximation 

Given X G the LRMA problem can be mathemati¬ 

cally formulated as 




IX-BCII 


subject to B^B = I*; 


( 1 ) 


Setting the derivative of the objective function of C to zero, 
we obtain C = B^X. Since ||A|||, = Tr(AA'^) and Tr(A) = 
Tr(A^), we can expand the objective function as 

||X-BCfF 

= Tr {(X - BC)(X - BC)"^} 

= Tr (XX”^) - 2Tr (bCX"^) + Tr (bCC'^B’^) . (2) 

Substituting C = B^X into E] and dropping the constant 
term, we obtain the equivalent problem of ([D, i e., 

maxTr 

subject to B'''B = I*;. (3) 
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(a) Column vectors of B in LRMA 








(b) Column vectors of B in SLRMA 


(c) Column vectors of B in Equation (13. 

Fig. 2. Comparison of LRMA and SLRMA in terms of intra-correlation in B. The test image set contains 150 frames of the “foreman” video of resolution 
88 X 72, representing in a matrix X G ^6336x150 samples are shown in Figure |5ja). We visualize the first five column vectors of B for LRMA and 

SLRMA in (a) and (b). Both LRMA and SLRMA approximate matrix X with the same approximation error. In SLRMA, we set ^ the 2D DCT matrix. 


It is well known that the problem in (|3l) has an optimal solution 
Ea, which consists of k eigenvectors of XX^, corresponding 
to the k largest eigenvalues. We refer the readers to 1^ for 
more technical details about LRMA. 


B. Graph Transform 

Let s G be a signal defined on an undirected, 

connected and unweighted graph with n vertices denoted by 
Q = (V, f), where V and £ are the sets of vertices and edges, 
respectively. The graph’s adjacency matrix E G is given 

by 


r 1 if (i, j) e £ 

f 0 otherwise, 


(4) 


and its degree matrix F e 

.r ■ ■ 

f,_, ^ J Ei=ieii lf^=J 
0 otherwise. 


fij — 


a diagonal matrix, is defined 

(5) 


Then the graph Laplacian matrix L G 


is computed as 


L = F-E. 


( 6 ) 


Since L is a real symmetric matrix, it has a set of real and 
non-negative eigenvalues denoted by - ,n associated 

with a complete set of orthogonal eigenvectors denoted by 
{Ui e 2 ,■■■,«, i-e., 

L = , (7) 

where \Jgt = [ui U2 • • • u^] G and A = 

diag(Ai,... ,An) 


Similar to the Fourier bases, which are the eigenvectors of 
the one-dimensional Laplace operator, the eigenvectors of L 
also possess harmonic behavior llJTl . Following (381, we call 
the eigenvector matrix based transformation graph transform 
(GT). Thus, one can decorrelate the signal as follows: 

r = UjtS, (8) 

where r G consists of sparse or approximately sparse 

transform coefficients. 

IV. Sparse Low-Rank Matrix Approximation 
A. Problem Statement 

Recall that LRMA can effectively exploit the inter- 
coherence of data samples, but it fails to exploit their intra- 
coherence. Thus, the intra-coherence is delivered into the 
columns of B. The issue to be addressed is how to effectively 
exploit such intra-coherence. 

It is well-known that some prescribed orthogonal trans¬ 
forms, e.g, DCT, DWT, and GT denoted sls ^ e 

= I^), can decorrelate signals (371, (121, 
i.e., producing approximately sparse transform coefficients. 
In order to explore the intra-coherence inheriting from the 
data samples, one may consider applying $ to columns of 
B following LRMA, which is similar to techniques in (T^ 
and 0, but such stepwise manner induces large reconstruc¬ 
tion error (See results in Section IB. The reasons are: (i) 
LRMA suppresses the spatial characteristics of data, so they 
cannot explore the intra-coherence very well in a direct way; 









4 



Fig. 3. Illustration of sparse LRMA. ^ is a prescribed orthogonal transform, such as DCT, DWT and GT, where each column corresponds to a frequency. 
The column vectors in matrix B are sparse and orthogonal, i.e., = 1/^. The empty boxes in B are zeros. 


(ii) columns of B are separately decorrelated, which cannot 
preserve the relationship among them (e.g., orthogonality). 
Therefore, we propose SLRMA to explore the intra- and inter¬ 
coherence simultaneously and integrally, and it is cast as the 
following optimization problem: 


min 

X-BC 

2 


B,C 


F 


subject to B^B = I 

k 


k 




0 



Note that we constrain B to be sparse with respect to 
making the derived B via optimization adapted to The 
difference between B and B by LRMA can be observed by 
comparing Figure [2ta) and Figure [2c). Furthermore, taking 
||$^A|||. = ||A|||. and with a slight abuse of notation (i.e, 
replacing $^B by B), we can rewrite the problem in (O as 

min ll^'^X - BCIll 

B,C 

subject to B^B = Ik and ||B||o < s. (10) 

In all, the proposed SLRMA can be interpreted as follows: 
the transform coefficients of data samples are low-rank approx¬ 
imated under the sparsity constraint, in which $ and the spar¬ 
sity constraint can explore the intra-coherence integrally, and 
the low-rank representation can explore the inter-coherence. 
These two types of decorrelation are simultaneously performed 
through optimization. Regarding it can be set according to 
data samples. For example, we can adopt DCT or DWT as 
the transform matrix for natural images, and the GT matrix 
for data defined on general graphs. 

B. Numerical Solver 

Similar to the process of o to (O, the problem in (llQb can 
be simplified as 

±3 

subject to B^B = Ik and ||B||o < s, (11) 

where Z = Instead of solving CHi directly, we solve 

its Lagrangian version: 

min -||ZTb||^+ 7 ||B||o 

subject to B^B = I/^, (12) 


where the regularization parameter 7 controls the sparsity of 
B: the larger the value of 7 is, the sparser the matrix B is. 

We adopt the inexact augmented Lagrangian multiplier 
method (lALM) ll40l to solve ([T2 . Introducing two auxiliary 
matrices P G and Q G we rewrite ([T2 

equivalently as 

-||ZTb|||+7||P||o 

subject to P = B, Q = B, Q^Q = 1/^. (13) 

The augmented Lagrangian form of (fTSl) is given by 
argmin-IIZ^BIll, + 7||P||o + Tr (y^B - P)) 

QTQ = Ifc 

+ Tr (y^B - Q)) + I (||B - Pill + ||B - Qlll), 

(14) 

where and G are the Lagrange multipliers, and 

the regularization parameter p is a positive scalar. 

With initialized P, Q, Yp, and Yg, the lALM solves 
the optimization problem (Hii in an iterative manner (see 
Algorithm d] for the pseudocode). Each iteration alternatively 
solves the following four subproblems: 

1) The 'B-Subproblem: The B-subproblem is with a 
quadratic form: 

mm -||ZTb||| + |(||B-P + Yp/p||| 

+ ||B-Q + Yq/p|||). (15) 

Equation (fTSl) reaches the minimal when the first-order deriva¬ 
tive to B vanishes: 

B = (2pl^ - 2ZZT) (p(P + Q) - Yp - Yq) . (16) 

2) The -Subproblem: The P-subproblem is written as 

mm7||P||o + |||P - (B + Yp/p)|||. (17) 

Let B = B + Yp/p, and Equation (fTTl) can be rewritten in 
element-wise as 

min 7I (p , ^0) + § (pij - ) , (18) 

2 \ / 

where l(p.^.^o) is a indicator function, i.e., l(p.^.^o) = 
if Pij 7^ 0 and 0 otherwise. Furthermore, the objective 
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function ([18]) is minimized if each independent univariate in 
the summarization is minimized, i.e., 

P f — \2 

mm7l(p,^^^o) + 2 [Pij “ h) ■ ( 19 ) 


It can be easily checked that pij has an unique solution given 
by the hard thresholding operator, i.e.. 


Pij = 


bij ii \bij\ > ^/¥f/p 
0 otherwise. 


( 20 ) 


3) The Q-Subproblem: It is expressed as 

min ^\\Q-(B + Yq/p)\\%. (21) 

Q Q=lfc ^ 

According to Theorem [T] the problem ([21]) has a closed-form 
solution: 

Q = (22) 

where B = B + Yg/p, and the orthogonal matrix V G 
and the diagonal matrix D G satisfy the eigendecom- 

position of B^B, i.e., B^B = VDV^. 

Theorem 1 (EB)' Given X G and rank(X) = n, 

the constrained quadratic problem: 

X* = argmin -||X — A|||., subject to X^X = 


has the closed-form solution, /.^.,X* = AVD“^/^V^, where 
V G is an orthogonal matrix and D G is a 

diagonal matrix satisfying the eigendecomposition of A. 

5) Updating Y p, Yg and p: Finally, we update the 
Largrange multipliers Yp and Yg, and the parameter p as 


Y^er+l _ ^ 23 ) 

Y^er+1 _ Y^ter ^24) 

p^^-+i=min(p^^-a,p^,,), (25) 


where the parameter a> 1 improves the convergence rate and 
iter is the iteration index. 

Lin et al. iQi proved that lALM guarantees convergence 
to an optimal solution on a convex optimization problem ||4l. 
However, the objective function in (fT^ and the orthogonality 
constraint are both non-convex. To the best of our knowledge, 
there is no theoretical evidence on the global convergence 
of lALM on a non-convex problem. Fortunately, thanks to 
the closed-form solution for each subproblem, we observe 
that SLRMA via lALM empirically converges well (i.e., the 
objective function is reduced to a stable value after a few 
iterations) and produces satisfactory performance on real- 
world datasets (see Section lYl). 


V. SLRMA-based Data Compression 

In this section, we develop and evaluate the SLRMA-based 
compression schemes for 3D dynamic meshes and 2D image 
sets consisting of facial images or frames of video with slow 
motions and nearly stationary background. These types of 
data exhibit intra- (or spatial) coherence. Besides, the inter¬ 
coherence among them makes the constructed matrices possess 
approximate low-rank characteristics. Thus, it is suitable to 
compress them using LRMA- and SLRMA-based methods. 


Algorithm 1 Computing SLRMA via lALM 
Input. X, k, p, (T, Pmax 

Output: B, C 

1: initialize P and Q using the leftmost k columns of 

2: Yp = Yg = 0 
3: Z = 

4: while not convergence do 
5: update B using (O 

6: update P using (l20b 

7: update Q using (l22l) 

8: update Yp, Yg and p using (|23]) -(I^. respectively 

9: check the convergence conditions 

||B - P||oo < 10-® and ||B - QIU < IQ-^ 

10: end while 

11: update C using B^Z 


C 



(a) 



(b) 


Fig. 4. The flowcharts of the proposed SLRMA-based compression schemes 
for image sets (a) and 3D dynamic meshes (b). 


A. Image Sets 

Given a grayscale image set with n images denoted by 
{Mi where w and h are the resolution of 

the images, respectively, we reshape each image as a column 
vector and stack them into a matrix denoted by X G 
As shown in Figure Oa), our SLRMA-based compression 
scheme is very simple. It decomposes X into an extremely 
sparse matrix B and a coefficients matrix C. They are then 
uniformly quantized before the nonzero elements, as well 
as their locations, are entropy-coded into a bitstream using 
arithmetic coding. 

We tested four grayscale image sets: Image set I: 150 facial 
images (resolution: 64 x 64) extracted from the AR dataset 
ll4^ : Image set II: 150 facial images (resolution: 65 x 75) 
extracted from the Fa of FERET dataset (431; Image set III: 
150 images (resolution: 88 x 72) extracted from the “ear¬ 
phone” video sequenceQ; Image set IV: 150 images (resolution: 

^ http://trace.eas.asu.edu/yuv/ 
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(a) Image set used in Figure [2] 


S B ^ H 


(b) Image set I 



(c) Image set II 

RBILRLBL 

(d) Image set III 



(f) 3D dynamic meshes 


Fig. 5. Examples of test datasets. 

88 X 72) extracted from the “hall” video sequence. Some data 
samples are respectively shown in Figures [3b)-(e). The values 
of p, ct, and pmax are set to 10“^, 1.05, and 10^^, respectively. 
Note that these parameters are insensitive to the size of image 
sets. The percentage of zero elements of B is denoted by 
and various is obtained by adjusting 7 . 

Firstly, we evaluate the convergence and low-rank ap¬ 
proximation performance of SLRMA under various pb, i.e., 
the relationship among approximation error, k and pb- The 
approximation error is measured by the Root Mean Square 
Error J^RMSE) between the original data X and approximate 
data X, i.e.. 


/ 1 


2 

RMSE = W- 

X-X 


V mn 


f' 


The SLRAM is validated under two cases, which are denoted 
by: (i) SLRMA (DCT) in which $ is set as the 2D DCT 
matrix, i.e., the Kronecker product of two ID DCT matrices; 
(ii) SLRMA (DWT) in which ^ is realized by the 2D DWT 
matrix obtained as the Kronecker product of two multi-level 
Haar matrices (3-level for image sets I and II, and 4-level for 
image sets III and IV). As a comparison, we also present the 
performance of stepwise methods, namely LRMA-DCT (resp. 
LRMA-DWT), in which the same DCT (resp. DWT) matrix 


as SLRMA is applied to columns of B obtained by LRMA, 
and then pb percentage of all transformed coefficients with 
smallest magnitude are set to zero before carrying out the 
inverse transform. 

As the first row of Figure [ 6 ] shows, SLRMA empirically 
converges well, i.e., the objective function is reduced to a 
stable value after a few iterations. The second and forth 
rows of Figure [ 6 | verify the excellent low-rank approximation 
performance of SLRMA, that is, at the same k, the approx¬ 
imation error by SLRMA is comparable to that by LRMA, 
which is the lower bound. Moreover, SLRMA (DWT) is better 
than SLRMA (DCT) on image sets III and IV since the 
multi-level DWT has greater potential for sparsly representing 
images with complex textures than DCT ll44ll . For image 
sets I and II, which contain relatively smooth facial images, 
both DWT and DCT can decorrelate the data well, so that 
the difference between SLRMA (DWT) and SLRMA (DCT) 
is very slight. However, the stepwise methods LRMA-DCT 
and LRMA-DWT, shown in the third and fifth rows, induce 
much larger approximation error than SLRMA at the same k 
and Pb- Finally, we show the rate-distortion performance of 
the SLRMA-based compression scheme in the sixth row of 
Figure [ 6 l where we can see that the SLRMA-based method 
consistently produces higher peak-signal-noise-ratio (PSNR) 
at the same bit per pixel (bpp). The improvement is up to 
3 dB compared to the method in UHl, namely LRMA-JP2K, 
in which the DWT-based JP2K image encoder is employed 
to encode the basis vectors after LRMA is applied. Figure [7] 
also compares visual results by SLRMA and LRMA-JP2K, 
where we can observe that areas around persons marked out 
by red ovals are obviously blurred for LRMA-JP2K images, 
but the corresponding images by SLRMA are sharper and 
visually closer to original images at the same bpp. Last but not 
least, we believe that our SLRMA-based compression scheme 
can achieve higher rate-distortion performance by adopting 
more advanced entropy coding, e.g., context-adaptive binary 
arithmetic coding (CABAC) Ea and embedded block coding 
with optimal truncation (EBCOT) 1291 . 


B. 3D Dynamic Meshes 

Given a 3D dynamic mesh sequence with m vertices and n 
frames, it can be represented as three matrices X^^ G 
X^ G ^ corresponding to the x—, y—, 

and z—dimensions of vertices, respectively, and each column 
of {d := {x^y^z}) corresponds to the d—dimension 
of vertices of one frame. The SLRMA-based compression 
scheme is shown in Figure lUb), where X^^ is firstly factored 
into ^d and using SLRMA. Different from image sets, 
3D dynamic mesh sequences consist of successive frames of 
objects in motion, so that each row of X^, corresponding to the 
d-dimensional trajectory of one vertex, is a relatively smooth 
curve, indicating strong coherence. After the SLRMA, such 
coherence still exists in rows of l22l . For example. Figure 
[ 8 ] plots several rows of where their relative smoothness 
is verified. This can be easily explained using the analysis for 
intra-correlation of B of LRMA along the row space of X. To 
further reduce this temporal coherence, ID DCT is separately 
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(a) Image set I 


(b) Image set II 


(c) Image set III 


(d) Image set IV 


Fig. 6. Evaluation of SLRMA on image sets. row: convergence verification of SLRMA (pB = 0-6 and k = 30); 2’^^row: low-rank approximation 
performance of SLRMA (DCT) under various pb\ row: performance of the stepwise method LRMA-DCT under various pb', 4^^ row: low-rank 
approximation performance of SLRMA (DWT) under various pB', 5^^ row: performance of the stepwise method LRMA-DWT under various pB', 6^^ row: 
comparison of rate-distortion performance of SLRMA-based and LRMA-JPEG2K, and ^ is realized by DWT; Note that rows 2-5 share the same legend. 











































































































(a) 



Fig. 7. Comparison of visual results on image set. (a) original images from the Image set IV. (b) and (c) compressed ones by the LRMA-JP2K method 
under 0.38 bpp and 0.21 bpp, respectively; (d) and (e) compressed ones by our SLRMA-based method under 0.38 bpp and 0.21 bpp, respectively. 



Fig. 8. Illustration of the coherence within rows of 


performed on rows of i.e., 

Cd = ULcy (27) 

where \Jdct ^ stands for the ID DCT matrix, and 

each column corresponds to one frequency. Lastly, the nonzero 
entries of uniformly quantized and as well as their 
locations are entropy-coded using arithmetic coding. 

We take four 3D dynamic mesh sequences from ll46ll . in¬ 
cluding “Wheel” (2501 vertices), “Handstand” (2501 vertices), 
“Skirt” (2502 vertices), and “Dance” (1502 vertices). Each 
sequence has 150 frames. Some samples are shown in Figure 
[3f). The values of p, a, and pmax are set to 10^, 1.003, and 
10^^, respectively. The orthogonal matrix ^ is realized by 
the GT, as explained in Section IIII-Bi and can be computed 
according to the topology of 3D meshes. Note that the three 
dimensions are equally treated, i.e., the same k and pb are 
used. 

The empirical convergence of SLRMA is verified again 
in the first row of Figure [9j The low-rank approximation 
performance of SLRMA is shown in the second row of Figure 
O where we can see that at the same k, SLRMA produces 
comparable RMSEs as LRMA even when the value of ps 
increases up to 0.8. However, the stepwise method denoted 
by LRMA-GT produces much larger RMSEs. See the third 
row of Figure [9l Note that X = ^ M^mxn 


and X = [X.x]^y]^z] ^ ]^3mxn computing the 

approximation error using ([26l) . 

Finally, we evaluate the rate-distortion performance of the 
SLRMA-based compression method. The compression distor¬ 
tion is measured by the widely-adopted KG error (221, defined 
as 


KG error = 100 x 


X-X 


F 


l|X-^(X)||^’ 


(28) 


where E{X.) is an average matrix of the 

same size as X, of which the j-th column is 
••• 1] (X 3 ;)j[l ••• 1] (x^)j[l ••• l])"^ with 
(xc^)y being the average of (xc^)y. The bitrate is measured 
in bit per frame per vertex (bpfv). The overall compression 
performance of our scheme is affected by three parameters: 
k (the number of basis vectors), pb (the percentage of zero 
elements of B), and the quantization parameter. Currently, we 
use exhaustive search to determine their optimal combination. 
One can also employ the method in El or build rate and 
distortion models in terms of the three parameters, like O, 
to speed up this process in practice. We compare with Vasa’s 
scheme in ED, which is LRMA-based and represents the 
state-of-the-art performance. 

The forth row of Figure [9] shows typical rate-distortion 
curves for our scheme as well as Vasa’s, where it can be 
seen that for “Dance” and “Skirt”, our method produces much 
smaller distortion than Vasa’s at the same bpfv, especially 
at relatively small bpfvs. With respect to “Handstand” and 
“Wheel”, the rate-distortion curves of Vasa’s are not given 
since Vasa’s method can only work on manifold-meshes, 
but sequences “Handstand” and “Wheel” consists of non¬ 
manifold-meshes. Our scheme is independent of topology, 
which is an additional advantage. Finally, some visual results 
are shown in Figures [T0| and [TT] to further demonstrate the 
performance of our scheme, where we can observe that the 
decompressed frames are still close to the original ones and 
their quality are reasonable even when the bpfv is equal to 
0.25. 
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(a) “Dance” 


(b) “Handstand” 


(c) “Skirt” 


(d) “Wheel” 


Fig. 9. Evaluation of SLRMA on 3D dynamic meshes. row: the convergence verification of SLRMA (pB = 0.6 and k = 30); 2'^^ row: low-rank 
approximation performance of SLRMA (GT) at various ps', row: performance of the stepwise method LRMA-GT at various pb', 4^^ row: rate-distortion 
performance of our SLRMA-based method and the state-of-the-art method. Note that rows 2-3 share the same legend. 


VI. Conclusion and Future Work 

We presented sparse low-rank matrix approximation for 
effective data compression. In contrast to the conventional 
LRMA, SLRMA is able to explore both the intra- and inter¬ 
coherence among data simultaneously, producing extremely 
sparse basis functions. We formulated the SLRMA problem 
as a constrained optimization problem and solved it using the 
inexact augmented Lagrangian multiplier method. Although 
the optimization problem is non-convex, we observed that 
our method empirically converges well on real-world data, 
such as image sets and 3D dynamic meshes. Also, SLRMA 
exhibits excellent low-rank approximation performance, i.e., 
at the same rank, comparable approximation error as LRMA 
is produced even when 80% entries of basis vectors are zero. 
Moreover, at the same bitrate, the SLRMA-based compression 
schemes can reduce distortion by up to 53% for 3D dynamic 
meshes and improve PSNR by up to 3 dB for image sets 
compared with existing methods. 

In the future, we would like to further investigate the 


potential of SLRMA to compress other types of data (e.g., 
EEG signals). We also believe that SLRMA can be applied 
to other applications, such as patch-based image and video 
denoising, where both the similarity among patches (i.e., low- 
rank characteristic) and spatial structure of patches (i.e., the 
sparseness of patches with respect to particular bases) can be 
jointly taken into account. 
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